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ABSTRACT 



Context. Investigations of mass segregation are of vital interest for the understanding of the formation and dynamical evolution of stel- 
lar systems on a wide range of spatial scales. A consistent analysis requires a robust measure among different objects and well-defined 
comparison with theoretical expectations. Various methods have been used for this purpose but usually with limited significance, 
quantifiability, and application to both simulations and observations. 

Aims. We aim at developing a measure of mass segregation with as few parameters as possible, robustness against peculiar configura- 
tions, independence of mass determination, simple implementation, stable algorithm, and that is equally well adoptable for data from 
either simulations or observations. 

Methods. Our method is based on the minimum spanning tree (MST) that serves as a geometry-independent measure of concentration. 
Compared to previous such approaches we obtain a significant refinement by using the geometrical mean as an intermediate-pass. 
Results. The geometrical mean boosts the sensitivity compared to previous applications of the MST. It thus allows the detection of 
mass segregation with much higher confidence and for much lower degrees of mass segregation than other approaches. The method 
shows in particular very clear signatures even when applied to small subsets of the entire population. We confirm with high signifi- 
cance strong mass segregation of the five most massive stars in the Orion Nebula Cluster (ONC). 

Conclusions. Our method is the most sensitive general measure of mass segregation so far and provides robust results for both data 
from simulations and observations. As such it is ideally suited for tracking mass segregation in young star clusters and to investigate 
the long standing paradigm of primordial mass segregation by comparison of simulations and observations. 
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1. Introduction 

It is commonly accepted that star formation does usually not oc- 
cur in isolation but that a large majority of young stars - up to 
90% - are part of a cluster (Lada & Lada 20031 Evans et al. 
2009). The dynamical evolution of a star cluster leaves a vari- 
ety of imprints in the phase space of its stellar population which 
are good tracers of the dynamical age of the cluster. This quan- 
tity is in particular interesting when compared to the physical 
age. A higher dynamical than physical age means that observ- 
able dynamical imprints did not have enough time to evolve dy- 
namically and thus must have been present - at least partially - 
already at the beginning. This is usually known as primordial 
origin. 

One of the most widely discussed aspects of the dynamical 
evolution of young star clusters is that of mass segregation. From 
theoretical work it is well known that this process is inevitably 
entangled with the dynamical evolution of a self-gravitating sys 
tem of at least two different mass components (Spitzer 1969 



Farouki et al.||1983[ |Spurzem & T akahashi jp95||Khalisi eTal 
2007} . Due to energy equipartition - hence via two-body en- 
counters - the more massive particles tend to settle towards the 
cluster centre over time while the lower-mass particles are pref- 
erentially pushed to the outer parts. However, it is a much more 



challenging task to identify mass segregation observationally in 
real objects than theoretically from 'clean' numerical simula- 
tions. This is even more severe for young star clusters that are 
usually still embedded in their natal gas and the dynamical and 
physical age of which is much more difficult to estimate. 

However, the investigation of mass segregation in these 
young stellar systems is of particular interest for a deeper un- 
derstanding of the star formation process. Three fundamental 
questions are part of the scientific discussion in this context: (i) 
Do young star clusters (really) show mass segregation? (ii) What 
is the observed degree of mass segregation? (iii) Could the ob- 
served degree of mass segregation have developed dynamically 
or can it be only explained by primordial origin? 

An investigation of these important aspects of the star forma- 
tion process requires a solid tool that is at best independent of the 
method used to determine the stellar masses and independent of 
the geometry of the object, that provides an unambiguous mea- 
sure and is equally well applicable to observational and numer- 
ical data. Note that also dynamical models have the equivalent 
problem to identify mass segregation clearly and quantitatively. 

So far mainly four measures have been used to investigate 
mass segregation in (young) star clusters: i) the slope of the (dif- 
ferential) mass fu nction in different annuli around the cluster 
centre (M^ F ; e.g. |Richer et al.|l988| |Bolte|l989| |Hillenbrand 
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|1997| l, ii) the slope of the cumulative mass function in different 
annuli around the cluster centre (A1 MF 



e.g. 



Pandey et al. 



1992 



Hillenbra nd & Hartmann 1998), iii) the characteristi c radius o 
different mass-groups of stars (Af* ; e.g 



Farouki et al. 
and iv) the length of the minimum spann ing tree (MST) 



1983i, 
of dit- 



Allison et al. 



2009b 



a- 

iiie 



ferent mass-groups (vV( MST ; developed by i 

Most of these methods suffer from several weaknesses. Trie 
first three, vVt MF , >M MF , and At?, implicitly assume a spherical 
geometry and thus depend on the determination of some cluster 
centre. The first two of them introduce additional bias due to ra- 
dial binning and uncertainties in deriving the slope of the mass 
function. Furthermore, 
mass binning (see e.g. 



Wl^p suffers from uncertainties due to 



Stolte et al. 



2006, 



for a comparison of 
vVt MF and vV( MF applied to observational data). There is a fun- 
damental difference in the concept of the first and the last two 
methods: the former measure the mass distribution in different 
spatial volumes, the latter evaluate the spatial distribution of dif- 
ferent sets of most massive stars. Consequently, Wl^ and vV( MST 
do not require a direct measure of stellar masses but only a qual- 
itative criterion for correct ordering. This property is a huge ad- 
vantage in the face of observational data. However, M^ has also 
a significant weakness: the characteristic radius of a small sub- 
group is very sensitive to the definition of the cluster centre. 
Hence this method is not viable for a low degree of mass seg- 
regation, i.e. a signature from only a few most massive stars. 

In contrast, vVt MST does not show any of these disadvantages. 
However, it is also affected by quite low sensitivity that prevents 
unambiguous detection of weak mass segregation. To take ad- 
vantage of the potential strength of A1 MST we have developed 
an improved prescription for measuring mass segregation, in the 
following referred to as vV( MST , with significantly increased sen- 
sitivity. We present in this work our scheme and demonstrate its 
efficiency. 

In Section pi we describe our method vV( MST for measuring 
mass segregation and discuss a scheme for setting up initially 
mass segregated star cluster models. In Section[3]we discuss var- 
ious examples of the effectiveness of our scheme compared to 
previous methods and present a first numerical application. The 
conclusion and discussion mark the last section of this paper. 



2. Methods 

2.1. Geometrical minimum spanning tree T M st 

As a proxy for mass segregation we extend the method A1 MST 
developed by |Allison et al. (|2009b[l (see also Cartwright & 
|Whitworth||2004| |Schmeja & Klessen||2006| >. In summary, the 
authors use the minimum spanning tree (MST), the graph which 
connects all vertices within a given sample with the lowest pos- 
sible sum of edges and no closed loops ( |Gower & Ross 1969). 
The length of the MST, Zmst> is a measure of the concentration or 
compactness of a given sample of vertices and is unique whereas 
its shape does not need to be. Mass segregation of a stellar sys- 
tem of size N is quantified by comparing Z M st of the n most 
massive stars, Ect, with the average Zmst of k sets of n random 
cluster stars, < l^ ST >, and its standard deviation, A^ T . The 
distribution of /mst of the k random samples is indeed Gaussian. 
The ratios of these quantities, 
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1 A variant of the MST method has been presented just recently by 
Yu et al. 1201 1 ) at the time of submission of our publication. 



provide a quantitative measure of the degree of mass segrega- 
tion: the larger Amst the more concentrated are the massive stars 
compared to the reference sample; the associated standard devi- 
ation AAmst quantifies the significance of the result. To make 
this method work and comparable also with observational data 
all calculations are carried out in two dimensions, i.e. on a pro- 
jection of the set of vertices. 

We have chosen the number of random reference samples, k, 
such that a fraction p of the entire population of size N is covered 
independent of the sample size n: 



(N-nX 



r ln(l-p) 

'= r in(l-»/A0 L (2) 



where \x\ denotes the ceiling function 2 of x. The threshold has 
been set to p — 0.99. 

Our method vV( MST involves two crucial modifications of 
^mst mat ma ke h computationally much more effective and 



boost its sensitivity. First, unlike Allison et al. (2009b) we do not 



calculate the separations between all possible pairs of stars but 
determine the MST (in two dimensions) in a three-step proce- 
dure: first we use a 2D Delaunay triangulation (from the software 
package GEOMPACK: |Joe|1991[ ) to construct a useful graph of 
stellar positions projected onto a plane, then we sort the edges 
of the triangles in ascending order, and finally adapt Kruskal's 
algorithm (Kruskal 1956) with an efficient union-find-algorithm 
to construct the MST. 

A Delaunay triangulation has the very useful property that 
the MST construction (in any dimension but for Euclidean dis- 
tance) is its sub-graph (Preparata & Shamos 1985). The imple- 
mentation of [Joe ( 1991 1 involves a computational effort 



f DT ~tf(|£|.log|V|), (3) 

similar to the edge sorting by the quick-sort algorithm, 

f Q s~0(|£|-log|£|), (4) 

where \E\ is the number of edges and |V| the number of ver- 
tices. With two algorithmic "tricks" in the union-find algorithm 
("union by rank" and "path compression") we reduce its run- 
time to 

f UF ~tf(|£|-log*[V|), (5) 



where 



log*(n) = minji e N | log(log(. . . log(n) ...))< l) (6) 



and thus rather constant though in principle unlimited (Tarjan 



19791. So the total computational effort is dominated by the 



Delaunay triangulation and sorting of edges and thus scales as 

tut ~ 0(\V\ ■ log(|V|)) = 0{\E\ • log(|£|)) , (7) 

with (9(| V|) = 0{\E\). This is a significant improvement over the 
cost of the brute-force method, 0(\E\ 2 ). 

Second, we do not use directly the sum of the edges /mst as 
a measure yet their geometric mean 7mst, 



Tmst 



(ri \ 1/n 1 v< 

11 H = eX P nh ln 



(8) 



The ceiling function \x\ gives the lowest integer larger than or equal 



to x. 
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and its associated geometric standard deviation AyMST, 



Ay M ST = exp 



2" =1 (lne,-lny M ST) 



2^ 



(9) 



where e, are the n MST edges. Analogous to Amst, we obtain 
the new measure Fmst via a proper normalisation: 



_ ^MST Ar _ . re f 

MST - -^il . A1 MST - Ay MST . 
'MST 



(10) 



Note that the usage of a geometric mean involves a multiplicative 
standard deviation AF M st, i-e- the upper and lower \cr intervals 
are given by T M st ■ (AFmst)* 1 - 

The geometrical mean has two important properties that turn 
out to be very useful for our purpose of using the MST as a 
measure of mass segregation: 



1, 



The n-th root implicitly involves an intermediate-pass that 
damps contributions from extreme edge lengths very effec- 
tively (i.e. it gives a lower weight to very short or very long 
edges). Hence the mean edge length of a compact config- 
uration of even few stars will not be affected much by an 
"outlier". 

We demonstrate this property via a simple case of n— 1 edges 
with length / = <?o and one very short "outlier" representing 
a tight binary with / = e\ — eeo, where e <K 1 (ej and e 
could be also very large here in principle). Then the geomet- 
ric mean would yield 



I In 



< I >= (^ei) "" = el 



.1/n 



(ID 



In any practical case (i.e. realistic star clusters models, obser- 
vational data) the relevant binary separation will be at most 
3 orders of magnitude smaller than the mean separation of 
single stars (or binary centre-of-mass). 3 So with e - 1CT 3 
we obtain even for a very small sub-sample of n — 5 stars a 
mean edge length < / >^ l/4eo- This demonstrates the very 
effective damping of extreme values by the geometric mean. 
The product of edges has the valuable property that com- 
mon edges in the two samples of massive and reference stars, 
yJJJIj and y^ f ST , are cancelled out by normalisation, Eq. ( fTO) . 
So it is only the disjoint set of edge lengths that determines 
Tmst and hence the degree of mass segregation. Compared 
to AI^st our scheme vVl^ ST is thus much more robust when 
applied to stellar systems with a high binary fraction. 



2.2. Primordial mass-segregation 

A thorough test of our method At^ ST requires the generation 
of predefined mass segregate d stellar system s. For this purpose 



we found the method of Subr et al. ( 2008 1 to be very useful. 
The strength of the method is the generation of a well defined 
degree of mass segregation that is controlled via one single pa- 
rameter, S , and the creation of dynamically consistent models by 
modelling local potentials and velocities in a quasi-equilibrium 
state. The so-called index of mass segregation covers the range 
S G [0, 1), where 5=0 corresponds to an entirely unsegregated 



3 This estimate is based on the following reference values: the mean 
stellar separation in the Orion Nebula Cluster is about 2.5 pc/4000 1/3 ~ 
0.15 pc a 30000 A U; the HST has a resolution of -50 AU at an assumed 
distance of 400 pc JMenten et al.|2007[[JeffrTes|2007[|Kraus et al.|2009> . 
Thus, binaries with separations < 10 _J would remain unresolved. 




x [arbitrary length unit] 




x [arbitrary length unit] 

Fig. 1. Visualisation of the two-dimensional minimum spanning 
tree (MST) for numerical cluster models with different initial 



mass segregation parameters S (see Section 2.2 1, The solid blue 
lines (connecting the small black dots and large red dots) rep- 
resent the MST of the entire population. The red dashed lines 
(connecting the large red dots) represent the MST of the ten most 
massive particles. Top: S = 0.1. Bottom: S - 0.9. 



state, while the upper limit 5 = 1 marks the maximum possi- 
ble segregation. In Fig. [T] we show two initially mass segregated 
models of 100 particles with S — 0.1 and S = 0.9 and their 
corresponding MST, respectively. 

In summary, the authors find that a mean inter-particle po- 
tential of the form 



(c/y)=2(l-S) 2 (C/tot> 



TO; Iflj 

Ml 



(M> ,M\ 

sub sub 



v-S 



(12) 
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using ordered subsets of stars, 



ML 



■E 

7=1 



ni\ ^ m 2 ^ ■•■ ^ m N 



(13) 



provides one of the simplest forms that characterise locally con- 
sistent mass segregation. This equation provides a constraint on 
the distribution function from which one can construct a cluster 
with the desired degree of mass segregation S by adding one-by- 
one the individual stars from a mass-ordered set. The underlying 
distribution function 



«(r)<xr 2 (r 2 (M< ub ) + r 2 ) 



-5/2 



with 



r P (AC) 



1 



M> ^ 

sub 



3n GMi 



32 | (U lot ) | 1 - S I M, 



is 



(14) 



(15) 



provides a good estimate of the real distribution function of a 
Plummer sphere with mass segregation index S . 

The authors provide a numerical C-code plumix for gener- 
ating the cluster according to the algorithm described in their 
paper on the AlfA web page: http://www.astro.uni-bonn.de 

3. Results 

In this section we provide examples showing the much bet- 
ter performance of A4^ ST than A1^ ST . However, we will also 
compare with the more traditional method vV(^ F introduced in 
SectionQ] 



3.1. Special configurations 

First, we will demonstrate the power of vVf^ ST for some simple 
setups of artificial mass segregation as shown in Fig. [2] 

The idea to improve M^ ST and develop A^^ ST was in fact 
motivated by the goal to find a measure that reflects the optical 
impression of a lower or higher degree of mass segregation. The 
plots in Fig. |2] depict three different artificial configurations of 
the five most massive stars in a star cluster of Ik members, desig- 
nated "cross", "ring", and "clump", from top to bottom. All these 
configurations are characterised by identical Imst, i-e- according 
to A1^ ST they represent the same degree of mass segregation. 
However, from an observer's point of view the cross appears to 
show a lower degree of mass segregation than the ring (i.e. the 
latter appears more compact), while the clump would usually be 
interpreted as a highly segregated system with a peculiar outlier. 

Translated into a consistent algorithm we aim at measuring 
the compactness of a stellar system by assigning a higher weight 
to the dominant configuration of stars and hence to overc ome the 
"degeneracy" of M^ SJ . As already discussed in Section|2.l|this 
is mainly achieved by damping the contribution from "outliers", 
i.e. single edges with extreme lengths e, compared to the median 
of all edges, via the geometrical mean. Fig. [3] demonstrates the 
effect of vV(^ ST . While by construction Amst (black dashed line) 
is identical for all three configurations, F M st depends strongly 
on the degree of concentration of the dominant sample of stars. 
Using the five most massive stars for the calculation of Tmst the 
highly concentrated "clump" shows a roughly two times higher 
degree of mass segregation than the other two configurations. 
Also, it's significance (i.e. standard deviation in relation to the 
mean) is about 1.5 times higher than in the case of A M st- 

Note that though only the five most massive stars form a cen- 
trally concentrated configuration, i.e. are mass-segregated, one 
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Fig. 2. Three artificial configurations of massive stars with iden- 
tical Zmst in a model star cluster of Ik members. Top: "cross", 
Middle: "ring", Bottom: "clump". 
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Fig. 3. Comparison of Tmst (blue solid lines) and Amst (black 
dashed line) of three artificial configurations of massive stars in a 
model star cluster. From top to bottom, the solid lines with their 
symbols represent the artificial configuration "clump" (closed 
circles), "ring" (open circles), and "cross" (crosses). 



obtains a signature of mass segregation for a sample of up to 20 
most massive stars. In particular, Tmst of the ten most massive 
stars does even depend on the geometry of the configuration. 

3.2. Initially mass-segregated clusters 



Using the method of Subr et al. (2008) to create initially mass- 
segregated star clusters (see Section 2.2 1 we have generated a 
set of low- to high-mass star clusters with 100, Ik, 10k, and 
100k stars, respectively. Their underlying mass function from 



Kroupa (2001 1 in the range 0.08 - 150 M results in an 



erage stellar mass of ~O.6M . So our clusters span the mass 
range from 60 M G to 60k M which basically represents the 
entire observed cluster population in our Galaxy. All clusters 
have been set up with initial degrees of mass segregation S - 
{0.1,0.2,0.3,0.5,0.9}, each with ten configurations with differ- 
ent random seeds to reduce statistical uncertainties. However, in 
the following plots we show rescaled error bars that represent 
the statistical uncertainties of one single cluster. 

In Figs. |4] and [5] we show a selection of the most relevant 
results. On the left-hand side we compare F M st ("geometric") 
and Amst ("arithmetic") for the 5 (red), 10 (green), 20 (blue), 
50 (magenta), 100 (cyan), 200 (brown), 500 (orange), and 1000 
(black) most massive stars. The error bars and line thickness 
mark the lcr, 2<x, and 3cr uncertainties. The horizontal dotted 
lines mark integer values, the horizontal dashed line represents 
the unsegregated state r M sT = A M st = 1 ■ Star clusters of inter- 
mediate masses ~1000M o are well observed in our local neigh- 
bourhood. These objects are represented by our simulations with 
Ik particles. From Fig.Hlwe find that our measure of mass seg- 
regation, Fmst, detects a very low degree of mass segregation, 
S = 0.1, just above the lcr-level, nearly independent of the num- 
ber of most massive stars considered. The significance improves 
drastically for an intermediate degree of S — 0.3 to at least 3<x 
and even up to Act for the 20 to 100 most massive stars. Finally, 



for very strong mass segregation with S - 0.9 we obtain a very 
clear signature of more than 5<x for any number of (i.e. at least 
five) most massive stars. 

These results are very much better than using Amst which 
never approaches a significance of 3cr and - in particular - for 
S - 0.3 provides only a very weak lcr significance compared to 
3 - Act in the case of Tmst- 

On the right-hand side we plot the corresponding mass func- 
tions for comparison with the traditional method A1 MF to de- 
tect mass segregation. Here the mass function has been calcu- 
lated in annuli with a radius r - {l/Ar^, l/2rh,rh,r c ), where 
rh is the half-mass and r c the total radius of the star cluster. It 
is clearly demonstrated that only very strong mass segregation 
with S ~ 0.9 becomes evident via this method. In this case it is 
the mass range of stars around 10 M that provides the strongest 
signature. 

In contrast, our new measure Fmst is much more effective. 
In particular, the 10 to 20 most massive stars of a Ik star cluster 
usually provide the clearest signature of mass segregation. 

Qualitatively, we find the same results for more massive clus- 
ters of 10k stars in Fig. pj W( MST provides the best measure 
of mass segregation by far. However, there are some important 
quantitative differences. First, a much lower degree of mass seg- 
regation can be detected for more massive clusters, e.g. for 10k 
stars with S — 0. 1 a significance of 3cr is reached. Second, for a 
similar significance mass segregation becomes most evident for 
a larger number of most massive stars, e.g. the 20 most massive 
stars for a 10k system with S = 0.3 compared to the 10 most 
massive for a Ik system with S = 0.9. 

Both effects are in agreement with our expectations that in a 
more populous star cluster i) the same degree of mass segrega- 
tion S will result in a higher relative concentration of the same 
number of most massive stars, i.e. Fmst becomes larger, and ii) 
the absolute number of stars that show the same relative concen- 
tration is larger, i.e. AFmst becomes lower. 

The interplay of these two properties explains the existence 
and dependence of the local maximum of the significance of 
Tmst^ it peaks at the maximum ratio of Fmst and AFmst and thus 
increases for larger cluster masses and lower degrees of mass 
segregation towards larger numbers of most massive stars. 

From our entire set of initially mass-segregated clusters we 
find that a sample size of 10 to 20 most massive stars generally 
provides the clearest signature of mass segregation. 

Note that from Figs. Hand pi we conclude that using vVt MF 
the best estimate of mass segregation in star clusters from Ik to 
10k members would be for masses at ~10M Q in an annulus of 
radius r a l/Ar^. 

3.3. Mass-segregation in the ONC 

We recall that one of our goals was to develop a method that 
equally well applies to data from simulations and observations 
and thus demonstrate the excellent performance of vVt MST on ob- 
servational data of the ~1 Myr old Orion Nebula Cluster (ONC) 



obtained by Hillenbrand (1997). The sample contains 929 stars 
with mass estimates and so provides a robust test of the algo- 
rithm. 

The resulting plot of r MS T for the 5, 10, 20, 50, 100, 200, 
500 most massive, and all 929 stars in the upper part of Fig. [6] 
provides a clear signature of a significant concentration of the 
20 most massive stars, much stronger than given by Amst- 
Interestingly, our calculation of A M st shows slightly stronger 
mass segregation than published by Allison et al. (2009a). 



Compared to the initially mass-segregated cluster models with 
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Fig. 4. Diagnostics of initially mass segregated star clusters with Ik members following the prescription of Subr et al.| l |2008) . From 
top to bottom the degree of mass segregation, S , equals 0.1, 0.3, and 0.9 (see text for more details). On the left-hand side we compare 
Tmst and Amst for the 5, 10, 20, 50, 100, 200, 500, 1000 most massive stars. The error bars and line thickness mark the lcr, 2cr, 
and 3cr uncertainties. On the right-hand side we plot the corresponding mass function of the entire cluster population (solid black 
line), and the population within one (red tics), one-half (green crosses), and one-forth (blue dots) half-mass radius. The error bars 
mark the lcr uncertainties. 



Ik particles presented in Fig. [4] the plot of r M s T for the ONC 
shows a more complex distribution: there is a sharp drop be- 
tween the 10 and 20 most massive stars. The significance of 
the first two data points (>4c) corresponds to the model with 
S = 0.9, the <3cr significance of the other resembles the much 
less segregated models with S < 0.3. 

In summary, i) mass segregation in the ONC at the current 
age ~ 1 Myr is much stronger than estimated before, and ii) it is 
clearly detectable not only for the five most massive stars yet for 
the 20 most massive objects. 



However, the particular features of the F M s T distribution re- 
quire a more careful analysis. We do so by calculating Fmst of 
disjoint particle groups, i.e. the most massive 5, 6 to 10, 1 1 to 20, 
etc. stars. The corresponding plot at the bottom of Fig.|6]shows 
that it is only the five most massive stars that are really strongly 
mass segregated (with a significance >4<x). Their configuration 
is so dominant that adding the next 15 most massive stars - which 
are not segregated - still shows a strong signature at the 3<x level, 
similar to the results in Section |3~T1 
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Fig. 5. Diagnostics of initially mass segregated star clusters with 10k members following the prescription of Subr et al. ( 2008 1. From 
top to bottom the degree of mass segregation, S , equals 0.1 and 0.3 (see text for more details). Details of the plots are given in the 
caption of Fig. HI 



Hillenbrand ( 1997) argues that her sample of combined pho- 
tometric and spectroscopic data appears completely represen- 
tative of the ONC, showing in particular a uniform complete- 
ness with cluster radius. However, because observational data 
are usually biased by incompleteness this issue is addressed 
in Appendix [A] We demonstrate that incompleteness has only 
a moderate effect on A^^ ST and provide a simple prescription 
for recovering the original signature of mass segregation via the 
completeness function. 

We conclude that all but the five most massive ONC stars 
do not show any signs of mass segregation. This result is in ex- 
cellent agreement with Huff & Stahler (2006, Fig. 4). The five 
most massive stars' extraordinary degree of mass segregation 
resembles their peculiar tight trapezium-like configuration (e.g. 



McCaughrean & Stauffer|T994}|Hillenbrand & Hartmann|1998| l 



3.4. Dynamical evolution of mass-segregation 

As a last application of our new method A^^ ST we analyse again 
numerical data yet here we trace the dynamical evolution of a 
star cluster model over time. The initial configuration is based 
on our ONC model as described in previous publications (e.g. 
Olczak et al. 20 1 0) l . In short, it is a single star cluster with 
an isothermal initial density distribution, a Maxwellian veloc- 
ity distribution, the mass function of Kroupa (2001 ) in the range 
0.08 - 150 Mq, and an initial size of 2.5 pc. Note that we use 
a spherically symmetric model with a smooth density distribu- 
tion without any substructure. However, in contrast to our basic 



equilibrium model here our stellar system is initially collapsing, 
starting from a viria l ratio — 0.1 . The simulations were carried 
out with Nbody6 4 ( Aarseth 2003 ) until a physical age of 5 Myr, 
corresponding to 13.5 N-body time units of the cluster. 

In Fig. 17] we show the mean of three simulations of the same 
model with different random seeds used to generate individual 
positions, velocities, and masses. We plot F M st and A M st only 
for the 5, 10, 20, 50, and 500 most massive stars for reasons of 
visibility. The much better performance of our improved method 
Atw ST compared to A1^ ST is evident. Though both measures 
do show similar maximum degrees of mass segregation Amst 
is subject to large variations over time. Only for the five most 
massive stars does it show high significance over a longer period 
while for the 10 and 50 most massive stars even values lower 
than one are quite common. 

In contrast, F M s T clearly increases over time with the steep- 
est gradient within the first 2 Myr (or 5.5 N-body time units). It 
clearly separates the different mass groups, inversely correlated 
with the sample size. The much better robustness of r M sx reflects 
again its very efficient damping of "outliers". 

We note that mass segregation in a collapsing, intermediate- 
size stellar cluster of Ik stars can occur very quickly, i.e. within 
only a few crossing times. This finding demonstrates that rapid 
mass segregation (in terms of dynamical time scale) does not 
require substructure as has been recently investigated by Allison 
et al. (2009a i. However, a detailed investigation of dynamical 



4 Note that we have used the GPU enabled version of Nbody6 avail- 
able at http://www.ast.cam.ac.uk/research/nbody 
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Fig. 6. Diagnostics of mass segregation in the young Orion 
Nebula Cluster (ONC) using observational data from 
|Hillenbrand| ( fl997l >. Top: r MS T and A M st for the 5, 10, 20, 
50, 100, 200, 500 most massive, and all 929 stars (left to right). 
Bottom: T MST and A MST for the 5, 6 to 10, 11 to 20, 21 to 50, 
51 to 100, 101 to 200, 201 to 500, and 500 to 929 most massive 
stars (left to right). The error bars and line thickness mark the 
lcr, 2cr, and 3tr uncertainties. 



mass segregation in young star clusters will be presented in an 
upcoming publication. 



4. Conclusion and Discussion 



We have developed a new method, -M MST , to measure mass 
segregation by significantly improving a previous approach of 
Allison et al. (2009a I based on the minimum spanning tree 



(MST), here referred to as A1 MST . Their method uses the nor- 
malised length of the MST of a given sample of stars, Amst, as 
a measure of compactness. 

Compared to "classical" measures of mass segregation such 
as the slope of the (differential) mass function in different annuli 
around the cluster centre (see Section [TJ, Amst overcomes sev- 
eral substantial weaknesses. Our new method vVf MST inherits all 
the advantages provided by vVt MST - 

- independence of cluster geometry, 

- no requirement of cluster centre, 

- no requirement of quantitative measure of masses, 

- one-parametric unique measure of mass segregation - 

and adds three significant improvements: 
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Fig. 7. Dynamical mass segregation in a star cluster with Ik par- 
ticles and cold initial conditions (Q - 0.1). The plots show the 
mean Fmst (top) and Amst (bottom) of three simulations for the 
5 (red solid line), 10 (green long-dashed line), 20 (blue short- 
dashed line), 50 (magenta dotted line), and 500 (orange dot- 
dashed line) most massive stars over time. The filled regions 
indicate lcr uncertainties. 



- nearly linear computational efficiency, 

- robustness against peculiar configurations, and 

- high sensitivity. 

This gain is based on two boosting ingredients: 

1 . The implementation of an efficient 0(N log A0 algorithm for 
the calculation of the MST, and 

2. the calculation of the geometrical mean of the connecting 
edges, Fmst, instead of just their sum, Amst- 

The high performance is achieved by construction of a 
graph via 2D Delaunay triangulation (from the software package 
GEOMPACK: |Joe|1991| l, quick sorting of its edges, an d an effi- 
cient numerical implementation of Kruskal's algorithm (Kruskal 
1956). The advantage of the geometrical mean is that it damps 
the contribution of "outliers" very efficiently and thus traces the 
concentration of the dominant stars of a configuration. 

We have demonstrated the advantage of vVt MST compared to 
other known and frequently used methods via various examples, 
both for numerical and observational data. 



1. In general, using only the ten to twenty most massive stars 
Tmst provides a robust and sensitive measure of mass segre- 
gation for the entire population of star clusters in our Galaxy. 
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2. In particular, very low degrees of mass segregation can be 
detected in massive clusters like NGC 3603 that consist of 
10k or more stars. 

3. We have also confirmed the very strong mass segregation of 
the five most massive stars in the ONC as reported by Huff 



|& Stahlerld2006T ). 
4. Incompleteness of observational data has only a moderate 
effect on A1^ ST and corrections can be implemented easily 
similar to methods like vVt^p- 

However, we also stress three important aspects that have to 
be considered for a careful investigation of mass segregation via 

yvl MST' 

1. Our conclusion that the 10 to 20 most massive stars gener- 
ally provide the most sensitive measure of mass segregation 
has been derived from model clusters with smooth mass seg- 
regation over the entire sample. We caution that this is not to 
be expected if only specific subsamples are affected by mass 
segregation (as shown for the five most massive stars of the 
ONC in Section 3.3 i; this is of particular concern for studies 



of primordial mass segregation. We thus generally recom- 
mend to use both the cumulative and the differential analysis 
of a set of mass groups to investigate mass segregation in 
(young) star clusters. 

2. The signature of a small sub-sample can be strong enough to 
bias r M sx of a larger parent sample. To avoid wrong conclu- 
sions one should also calculate Fmst of disjoint samples, e.g. 
of the 5, 6 to 10, 1 1 to 20 etc. most massive stars. 

3. The value of F M st of a sub-sample is independent of its 
location with respect to the entire sample. It only reflects 
the compactness of this sub-sample, not its concentration to- 
wards some centre of the entire sample. So a large r M s T does 
not necessarily mean that the corresponding sub-sample is 
mass-segregated. 

Thus, in practice, in particular when dealing with observa- 
tional data, the determination of some centre of the sample as 
reference point is still required. However, the value of Tmst 
remains independent of the reference point and so its advan- 
tage compared to other methods. 

Our improved method A^^ ST will be vital for future studies 
of mass segregation in stellar systems. Its robustness and sensi- 
tivity is ideal for tracing lower degrees and more complex types 
of mass segregation than before. This is of particular interest for 
the earliest stages of star formation that seem to form protoclus- 
ters of complex structure (e.g. Teixeira et al. 2006). Furthermore, 
it will also help to investigate the question whether mass segre- 



gation is observed at all in young star clusters (e.g. Ascenso et al 



2009). Finally, our method can be used in general for precise 
structure analysis of any type of stellar system from planetary 
debris disks to galaxy clusters. 

As a first application of A1^ ST to numerical simulations we 
have demonstrated that mass segregation in young star clusters 
can occur very quickly under dynamically cold initial condi- 
tions. Indeed we do expect young stellar systems to form under 
such conditions as a result of the gravitational collapse of their 
parent molecular cloud. Whether collapse still occurs in largely 
evolved young clusters like the ONC - the scenario correspond- 
ing to our simulation - is still unknown yet became favoured 
recen t ly (e.g.|Jones & Walker|1988||Kroupa 2000; S cally et al. 
2005; jFuresz et al.||2008t |Proszkow et al.||2009T|Tobin et~ 
2009). However, at least some fraction of the oldest stars must 



have formed during global collapse. Hence rapid mass segrega- 
tion affects the structure of star forming regions at the earliest 



stages and could thus explain the observed mass segregation of 
the most massive stars in the ONC. 

We will investigate this and other aspects of dynamical mass 
segregation in young star clusters under various conditions in an 
upcoming publication. 
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Appendix A: Data incompleteness and T M st 

We provide an example on how incompleteness of observational 
data affects r M sx and present a simple prescription for recov- 
ering the original signature of mass segregation via the corre- 
sponding completeness function. 



C. Olczak et al.: A highly efficient measure of mass segregation in star clusters 



P 



«] 



rii[ 



2.5 1 0.5 5 0.01 150 1 2 
Table A.l. Set of parameters used for Eq. (|A.1|> in the present 



study. 
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Fig. A.l. Model of completeness as a function of stellar mass 
and radial position in the ONC according to Eq. dATh and us- 



ing the parameters from Table A. 1 From right to left the graphs 
represent increasing radii in steps of 0.25 pc from the centre to 
2.5 pc, the outer boundary of the ONC. The solid (red), dashed 
(blue), and dotted (black) lines are grouped in steps of 1 pc, re- 
spectively. 

Here we use again the original sample of 929 stars in the 
ONC from Hillenbrand ( 1997) to generate a reduced sample via 
convolution with an artificial completeness function with radial 
and mass dependence of the form 

c(r, r u , p, a\,a u , m, m\,m u ,b\, b u ) — 

1 - e^[ft(r,r u ,p,a x ,a u ) [M(m,m u m a )f (r ' r "' p ' hM ) , (A.l) 
where 



K{r, r u , p, q\,q u ) = (q u - qi)(r/r u ) p + q\ , 
log(tn/m{) 



Mini, m\, m u ) = 



log(ra/ra u ) ' 



(A.2) 
(A.3) 



and the indices / and u denote lower and upper values, respec- 
tively. 

The parameters used to simulate loss of data due to obser- 



vational incompleteness are presented in Table A.l The corre 



sponding plot in Fig. |A.1| shows the completeness as a function 
of stellar mass in dependence of radial position in the cluster. 
From right to left the graphs represent increasing radii in steps 
of 0.25 pc from the centre to 2.5 pc, the outer boundary of the 
ONC. The solid (red), dashed (blue), and dotted (black) lines are 
grouped in steps of 1 pc, respectively. 

The functional form has been adopted such that it resembles 
observational incompleteness due to crowding in the dense and 
bright cluster centre and due to individual extinction over the 
entire cluster. It is expected that stars with masses > 10M o are 
too bright to be missed while low-mass objects close to the clus- 
ter centre with it's four luminous Trapezium stars remain mostly 
undetected. 

To reconstruct a larger unbiased sample from the incomplete 
sample we have used the inverse completeness function in the 
following simple prescription: 



1. Go to the next object i in the incomplete sample. 

2. Calculate its completeness c,- using its radial position r, and 
mass nij. 

J. IX Cj > C m i n . 

(a) Calculate the missing number of objects n, = 1 /c, - 1 
with r, and m,. 

(b) Generate for each additional object j its 

i. mass nij - nij(l + A), A e [-0.05,0.05] a random 

number, and 
ii. radial position Tj - r,-(l + rf), rj e [-0.1,0.1] a ran- 
dom number. 

In short, the parents' individual completeness c, determines 
the number of "clones" n, with slightly varying properties. The 
threshold c m ; n is used to prevent massive cloning of rare objects 
that could introduce a significant bias. We found that a surpris- 
ingly low value of c m j n = 0.2 is save. 

We compare F M st of the three samples - original, incom- 
plete, and reconstructed - in Fig. A.2 The left-hand side shows 



Tmst of the 5, 10, 20, 50, 100, 200, 500 most massive, and all 
stars, i.e. cumulative mass groups; the right-hand side shows 
Fmst of the 5, 6 to 10, 11 to 20, 21 to 50, 51 to 100, 101 to 
200, 201 to 500, and 500 to all most massive stars, i.e. differ- 
ential mass groups. The top plot refers to the original sample of 
929 ONC stars: it is the same data as on the left-hand side of 
Fig. [6] Below we plot Tmst of the incomplete sample that has 
been artificially reduced to 485 stars via Eq. ( |A.1[ ). This removal 
of nearly half the cluster population - preferentially low-mass 
stars in the cluster centre - has a marginal effect on Tmst: only 
for the five and ten most massive stars a significant difference 
is observed. However, the differential plot reveals again a dom- 
inant contribution from the five most massive stars, Tmst of the 
6-10 most massive remains nearly unchanged. 

The bottom plot contains data of additional 345 cloned ob- 
jects and so 830 stars in total - about 10 % less than the original 
sample. The reconstruction procedure seems to work fairly well: 
Tmst of the five and ten most massive stars is indistinguishable 
from its original values. However, the next intervals up to the 
200 most massive stars all show slightly lower degrees of mass 
segregation than originally. The reason is that the sample of even 
less massive stars (200-500 and 500-830 most massive stars) is 
marginally mass segregated (i.e. r M sx > 1 unlike in the original 
sample): these low-mass stars that make up the main part of the 
random reference sample were cloned preferentially closer to the 
cluster centre which reduced Tmst of the more massive stars. 

However, considering the reconstruction of up to 80 % indi- 
vidual incompleteness and nearly half of the entire incomplete 
sample, the result is very promising: the strong conclusion for 
the original sample that the five most massive stars in the ONC 
are highly mass segregated while all other stars do not show any 
signature of mass segregation remains unchanged for the recon- 
structed sample. 
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Fig. A.2. Diagnostics of mass segregation for three sets of ob- 
servational data of the young Orion Nebula Cluster (ONC). The 
left-hand side shows from left to right Tmst of the 5, 10, 20, 
50, 100, 200, 500 most massive, and all stars, i.e. cumulative 
mass groups; the right-hand side shows from left to right Tmst 
of the 5, 6 to 10, 11 to 20, 21 to 50, 51 to 100, 101 to 200, 
201 to 500, and 500 to all most massive stars, i.e. differen- 
tial mass groups. The error bars and line thickness mark the 
lcr, 2cr, and 3<x uncertainties. Top: Original set of 929 stars 
observed by Hillenbrand ( 1997 1. Middle: Incomplete sample. 



Bottom: Reconstructed sample. 
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